Random Blur Data Augmentation for Scene Text Recognition

نویسندگان

چکیده

In this paper, we propose to apply data augmentation approaches that provide more diverse training images, thus helping train robust deep models for the Scene Text Recognition (STR) task. The methods are Random Blur Region (RBR) and Units (RBUs). Specifically, first introduce RBR designed STR training, randomly selects a region sets pixels in with an average value. However, when provides various samples STR, it may make ambiguous reduce recognition accuracy. To address above problem, also RBUs divides blur into several units. Note of one unit share same way, can additional readable help models. Extensive experiments on datasets show achieve highly competitive performance. Besides, complementary commonly used techniques.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Random Erasing Data Augmentation

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN). In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values. In this process, training images with various levels of occlusion are generated, which reduces the risk of over-fitting and makes the model robust to o...

متن کامل

Towards Text Recognition in Natural Scene Images

In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...

متن کامل

Scene Text Recognition with Bilateral Regression

This paper focuses on improving the recognition of text in images of natural scenes, such as storefront signs or street signs. This is a difficult problem due to lighting conditions, variation in font shape and color, and complex backgrounds. We present a word recognition system that addresses these difficulties using an innovative technique to extract and recognize foreground text in an image....

متن کامل

Improving text recognition by distinguishing scene and overlay text

Video texts are closely related to the content of a video. They provide a valuable source for indexing and interpretation of video data. Text detection and recognition task in images or videos typically distinguished between overlay and scene text. Overlay text is artificially superimposed on the image at the time of editing and scene text is text captured by the recording system. Typically, OC...

متن کامل

ESM-Blur: Handling & rendering blur in 3D tracking and augmentation

The contribution of this paper is two-fold. First, we show how to extend the ESM algorithm to handle motion blur in 3D object tracking. ESM is a powerful algorithm for template matching-based tracking, but it can fail under motion blur. We introduce an image formation model that explicitly considers the possibility of blur, and show it results in a generalization of the original ESM algorithm. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3117035